Navigation Retrieval with Site Anchor Text

نویسندگان

  • Hideki Kawai
  • Kenji Tateishi
  • Toshikazu Fukushima
چکیده

In this paper we present an information retrieval system that indexes only site anchor text to verify the efficiency of reference information in a navigation retrieval task. We propose two relevancy measures to maximize limited information: reference consistency and specificity of word combination. Our results show that navigation retrieval with a site anchor text can pinpoint highly relevant documents despite using one-thousandth less information than traditional full-text search systems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluation of Web Retrieval Methods Using Anchor Text

In this paper, we evaluate two types of anchor texts: a page anchor and a site anchor. Since the anchor text tends to summarize information referred ahead, it can be expected that the terms appearing there have important meaning in information retrieval. We introduce a retrieval method to give high priority to the terms in the anchor text. In the experiment, we compared the proposed method with...

متن کامل

Verification of Effective Retrieval Method for Anchor Text on Navigational Retrieval

We participated in NTCIR-5 WEB Navigational Retrieval Subtask(Navi-2) in order to verify the most effective retrieval method for the index of anchor texts by using a retrieval system that indexed only anchor texts instead of full texts of Web pages. We introduced retrieval methods that combine one or more of six retrieval measures: (a) anchor frequency (af), (b) reference consistency (rc), (c) ...

متن کامل

AT&T at TREC-9

This year we come to TREC with a new retrieval system Tivra that we have implemented over the last year. Tivra is based on the vector space model, and is mainly designed to do large-scale web search with limited resources. We run Tivra on a cheap Linux box. It currently indexes around 14-15 gigabytes of web data per hour, and allows sub-second web searches for 2-3 word queries on a 700 MHz Pent...

متن کامل

Exploiting Anchor Text for the Navigational Web Retrieval at NTCIR-5

In the Navigational Retrieval Subtask 2 (Navi-2) at the NTCIR-5 WEB Task, a hypothetical user knows a specific item (e.g., a product, company, and person) and requires to find one or more representative Web pages related to the item. This paper describes our system participated in the Navi-2 subtask and reports the evaluation results of our system. Our system uses three types of information obt...

متن کامل

NTCIR-5 WEB Navi-2 Experiments at Osaka Kyoiku University - Page, Anchor and Title Indexing, and In-link Count, Inter Page and Inter Site Link Analyses

This paper describes experimental results of WEB Navigational Retrieval Subtask 2 (WEB Navi-2). We made three gram-based indices, namely indices for text in whole page, text in title tag and text in anchor tag. Since gram-based indices are able to index all strings in target text, words that are not found in dictionaries are also indexed essentially. We used words in TITLE tag of search topics ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004